The Disambiguation of Nominalizations
نویسنده
چکیده
This article addresses the interpretation of nominalizations, a particular class of compound nouns whose head noun is derived from a verb and whose modifier is interpreted as an argument of this verb. Any attempt to automatically interpret nominalizations needs to take into account: (a) the selectional constraints imposed by the nominalized compound head, (b) the fact that the relation of the modifier and the head noun can be ambiguous, and (c) the fact that these constraints can be easily overridden by contextual or pragmatic factors. The interpretation of nominalizations poses a further challenge for probabilistic approaches since the argument relations between a head and its modifier are not readily available in the corpus. Even an approximation that maps the compound head to its underlying verb provides insufficient evidence. We present an approach that treats the interpretation task as a disambiguation problem and show how we can " recreate " the missing distributional evidence by exploiting partial parsing, smoothing techniques, and contextual information. We combine these distinct information sources using Ripper, a system that learns sets of rules from data, and achieve an accuracy of 86.1% (over a baseline of 61.5%) on the British National Corpus.
منابع مشابه
Applying Constraints derived from the Context in the process of Incremental Sortal Specification of German ung-Nominalizations
Many German nominalizations with the affix -ung are sortally ambiguous. Within a sentence, lexico-semantic and/or syntactic phenomena may support disambiguation. The sortal interpretation of a nominalization may vary depending on the underlying syntactic analysis of one and the same, syntactically ambiguous sentence. We model the process of sortal disambiguation as a constraint-based incrementa...
متن کاملThe Automatic Interpretation of Nominalizations
This paper discusses the interpretation of nominalizations in domain independent wide-coverage text. We present a statistical model which interprets nominalizations based on the cooccurrence of verb-argument tuples in a large balanced corpus. We propose an algorithm which treats the interpretation task as a disambiguation problem and achieves a performance of approximately 80% by combining part...
متن کاملDealing with sortal ambiguity of nominalizations by underspecification
Based on data from German -ung nominalizations, I argue that selection restriction tests are not suitable as linguistic tools for ontological disambiguation. Consequently, I question the significance of ontology as a starting point for linguistic theorizing. Instead, I argue for an underspecified account of the ontology of nominalizations, in which disambiguation looses its central role in the ...
متن کامل-ung Nominalizations of Verbs of Saying in German Events and Propositions
-ung nominalizations of verbs of saying in German can be interpreted as events or propositions. There are context partners of such nominals which disambiguate the reading or suggest preferences. We consider a specific ambiguous constellation which is very frequent in German, where the nominal is the internal argument of a PP with nach. We detail the semantic representations of corresponding sen...
متن کاملApproximating the disambiguation of some German nominalizations by use of weak structural, lexical and corpus information Hacía la desambiguación de nominalizaciones en alemán a partir de información estructural, léxica y de corpus
Between classical symbolic word sense disambiguation (wsd) using explicit deep semantic representations of sentences and texts and statistical wsd using word co-occurrence information, there is a recent tendency towards mediating methods. Similar to so-called lightweight semantics (Marek, 2009) we suggest to only make sparse use of semantic information. We describe an approximation model based ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Linguistics
دوره 28 شماره
صفحات -
تاریخ انتشار 2002